Pivot Language
   HOME

TheInfoList



OR:

A pivot language, sometimes also called a bridge language, is an
artificial Artificiality (the state of being artificial or manmade) is the state of being the product of intentional human manufacture, rather than occurring naturally through processes not involving or requiring human activity. Connotations Artificiality ...
or
natural language In neuropsychology, linguistics, and philosophy of language, a natural language or ordinary language is any language that has evolved naturally in humans through use and repetition without conscious planning or premeditation. Natural languages ...
used as an intermediary language for translation between many different languages – to translate between any pair of languages A and B, one translates A to the pivot language P, then from P to B. Using a pivot language avoids the
combinatorial explosion In mathematics, a combinatorial explosion is the rapid growth of the complexity of a problem due to how the combinatorics of the problem is affected by the input, constraints, and bounds of the problem. Combinatorial explosion is sometimes used ...
of having translators across every combination of the supported languages, as the number of combinations of language is linear (n-1), rather than quadratic \left(\textstyle=\frac\right) – one need only know the language A and the pivot language P (and someone else the language B and the pivot P), rather than needing a different translator for every possible combination of A and B. The disadvantage of a pivot language is that each step of retranslation introduces possible mistakes and ambiguities – using a pivot language involves two steps, rather than one. For example, when
Hernán Cortés Hernán Cortés de Monroy y Pizarro Altamirano, 1st Marquess of the Valley of Oaxaca (; ; 1485 – December 2, 1547) was a Spanish ''conquistador'' who led an expedition that caused the fall of the Aztec Empire and brought large portions of w ...
communicated with
Mesoamerican Mesoamerica is a historical region and cultural area in southern North America and most of Central America. It extends from approximately central Mexico through Belize, Guatemala, El Salvador, Honduras, Nicaragua, and northern Costa Rica. Withi ...
Indians, he spoke Spanish to
Gerónimo de Aguilar Jerónimo de Aguilar O.F.M. (1489–1531) was a Franciscan friar born in Écija, Spain. Aguilar was sent to Panama to serve as a missionary. He was later shipwrecked on the Yucatán Peninsula in 1511 and captured by the Maya. In 1519 Hernán Cor ...
, who spoke
Mayan Mayan most commonly refers to: * Maya peoples, various indigenous peoples of Mesoamerica and northern Central America * Maya civilization, pre-Columbian culture of Mesoamerica and northern Central America * Mayan languages, language family spoken ...
to
Malintzin Marina or Malintzin ( 1500 – 1529), more popularly known as La Malinche , a Nahua woman from the Mexican Gulf Coast, became known for contributing to the Spanish conquest of the Aztec Empire (1519–1521), by acting as an interpreter, advi ...
, who spoke
Nahuatl Nahuatl (; ), Aztec, or Mexicano is a language or, by some definitions, a group of languages of the Uto-Aztecan language family. Varieties of Nahuatl are spoken by about Nahua peoples, most of whom live mainly in Central Mexico and have smaller ...
to the locals.


Examples

English English usually refers to: * English language * English people English may also refer to: Peoples, culture, and language * ''English'', an adjective for something of, from, or related to England ** English national ide ...
, French,
Russian Russian(s) refers to anything related to Russia, including: *Russians (, ''russkiye''), an ethnic group of the East Slavic peoples, primarily living in Russia and neighboring countries *Rossiyane (), Russian language term for all citizens and peo ...
, and
Arabic Arabic (, ' ; , ' or ) is a Semitic languages, Semitic language spoken primarily across the Arab world.Semitic languages: an international handbook / edited by Stefan Weninger; in collaboration with Geoffrey Khan, Michael P. Streck, Janet C ...
are often used as pivot languages.
Interlingua Interlingua (; ISO 639 language codes ia, ina) is an international auxiliary language (IAL) developed between 1937 and 1951 by the American International Auxiliary Language Association (IALA). It ranks among the most widely used IALs and is t ...
has been used as a pivot language in international conferences and has been proposed as a pivot language for the
European Union The European Union (EU) is a supranational political and economic union of member states that are located primarily in Europe. The union has a total area of and an estimated total population of about 447million. The EU has often been des ...
.Breinstrup, Thomas. "Linguaphobos? Non in le UE". inguaphobes? Not in the EU ''
Panorama in Interlingua ''Panorama in Interlingua'' is the primary periodical for the language Interlingua, published bimonthly. It was first issued in January 1988. The magazine is based in Odense, Denmark, and is written completely in Interlingua and the activities of ...
'', 2006, Issue 5.
Esperanto Esperanto ( or ) is the world's most widely spoken constructed international auxiliary language. Created by the Warsaw-based ophthalmologist L. L. Zamenhof in 1887, it was intended to be a universal second language for international communi ...
was proposed as a pivot language in the
Distributed Language Translation Distributed Language Translation ( eo, Distribuita Lingvo-Tradukado, DLT) was a project to develop an interlingual machine translation system for twelve European languages. It ran between 1985 and 1990. :The distinctive feature of DLT was the use ...
project and has been used in this way in the Majstro Tradukvortaro at the Esperanto websit
Majstro.com
The
Universal Networking Language {{Advert, date=April 2021 Universal Networking Language (UNL) is a declarative formal language specifically designed to represent semantic data extracted from natural language texts. It can be used as a pivot language in interlingual machine tran ...
is an artificial language specifically designed for use as a pivot language.


In computing

Pivot coding is also a common method of translating data for computer systems. For example, the
internet protocol The Internet Protocol (IP) is the network layer communications protocol in the Internet protocol suite for relaying datagrams across network boundaries. Its routing function enables internetworking, and essentially establishes the Internet. IP h ...
,
XML Extensible Markup Language (XML) is a markup language and file format for storing, transmitting, and reconstructing arbitrary data. It defines a set of rules for encoding documents in a format that is both human-readable and machine-readable. T ...
and
high level language In computer science, a high-level programming language is a programming language with strong Abstraction (computer science), abstraction from the details of the computer. In contrast to low-level programming languages, it may use natural language ...
s are pivot codings of computer data which are then often rendered into internal binary formats for particular computer systems.
Unicode Unicode, formally The Unicode Standard,The formal version reference is is an information technology Technical standard, standard for the consistent character encoding, encoding, representation, and handling of Character (computing), text expre ...
was designed to be usable as a pivot coding between various major existing character encodings, though its widespread adoption as a coding in its own right has made this usage unimportant.


In machine translation

Current statistical machine translation ( SMT) systems use
parallel corpora A parallel text is a text placed alongside its translation or translations. Parallel text alignment is the identification of the corresponding sentences in both halves of the parallel text. The Loeb Classical Library and the Clay Sanskrit Libr ...
for source (s) and target (t) languages to achieve their good results, but good parallel corpora are not available for all languages. A pivot language (p) enables the bridge between two languages, to which existing parallel corpora are entirely or partially not yet at hand. Pivot translation can be problematic because of the potential lack of fidelity of the information forwarded in the use of different corpora. From the use of two bilingual corpora (s-p & p-t) to set up the s-t bridge, linguistic data are inevitably lost. Rule-based machine translation ( RBMT) helps the system rescue this information, so that the system does not rely entirely on statistics but also on structural linguistic information. Three basic techniques are used to employ pivot language in machine translation: (1) ''triangulation'', which focuses on phrase paralleling between source and pivot (s-p) and between pivot and target (p-t); (2) ''
transfer Transfer may refer to: Arts and media * ''Transfer'' (2010 film), a German science-fiction movie directed by Damir Lukacevic and starring Zana Marjanović * ''Transfer'' (1966 film), a short film * ''Transfer'' (journal), in management studies ...
'', which translates the whole sentence of the source language to a pivot language and then to the target language; and (3) ''synthesis'', which builds a corpus of its own for system training. The triangulation method (also called ''phrase table multiplication'') calculates the probability of both translation correspondences and lexical weight in s-p and p-t, to try to induce a new s-t phrase table. The transfer method (also called ''sentence translation strategy'') simply carries a straightforward translation of s into p and then another translation of p into t without using probabilistic tests (as in triangulation). The synthetic method uses an existing corpus of s and tries to build an own synthetic corpus out of it that is used by the system to train itself. Then a bilingual s-p corpus is synthesized to enable a p-t translation. A direct comparison between triangulation and transfer methods for SMT systems has shown that triangulation achieves much better results than transfer. All three pivot language techniques enhance the performance of SMT systems. However, the ''synthetic'' technique doesn't work well with RBMT, and systems' performances are lower than expected. Hybrid SMT/RBMT systems achieve better translation quality than strict-SMT systems that rely on bad parallel corpora. The key role of RBMT systems is that they help fill the gap left in the translation process of s-p → p-t, in the sense that these parallels are included in the SMT model for s-t.


References

* Hua Wu and Haifeng Wang. 2009
Revisiting Pivot Language Approach for Machine Translation
ACL-09. * Utiyama, M. & H. Isahara (2006
A comparison of pivot methods for phrase-based statistical machine translation
In Proceedings of NAACL/HLT, 484{491. {{DEFAULTSORT:Pivot Language Translation Constructed languages Language